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Abstract 



We study the dynamics of information (or virus) dissemination by 
c/2 m mobile agents performing independent random walks on an n-node 

grid. We formulate our results in terms of two scenarios: broadcast- 
ing and gossiping. In the broadcasting scenario, the mobile agents are 
Cn initially placed uniformly at random among the grid nodes. At time 

0, one agent is informed of a rumor and starts a random walk. When 
an informed agent meets an uninformed agent, the latter becomes in- 
\^S formed and starts a new random walk. We study the broadcasting 

y—{ time of the system, that is, the time it takes for all agents to know 

r — the rumor. In the gossiping scenario, each agent is given a distinct 

f^ rumor at time and all agents start random walks. When two agents 

^^ meet, they share all rumors they are aware of. We study the gossiping 

^~~^ time of the system, that is, the time it takes for all agents to know 

^ all rumors. We prove that both the broadcasting and the gossiping 

times are O (ri/-y/m) w.h.p., thus achieving a tight characterization up 

to logarithmic factors. Previous results for the grid provided bounds 

J^ which were weaker and only concerned average times. In the context of 

virus infection, a corollary of our results is that static and dynamically 
moving agents are infected at about the same speed. 



'Support for the first three authors was provided, in part, by MIUR of Italy un- 
der project AlgoDEEP, and by the University of Padova under the Strategic Project 
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1 Introduction 

The dynamics of multiple random walks moving in a common domain pro- 
vides an attractive combinatorial framework for studying diffusion processes 
such as rumor spreading and virus infection. In this work we consider two 
related scenarios on an n-node square grid (or torus) G„. 
Broadcasting scenario: m agents are initially placed uniformly and inde- 
pendently at random in the nodes of G„. At time one agent is informed of 
a rumor and starts a random walk. When an informed agent meets an unin- 
formed agent at a node, the latter becomes informed of the rumor and starts 
its own random walk. How long does it take until all agents are informed 
(broadcasting time)? 

Gossiping scenario: m mobile agents start independent random walks 
from m nodes of Gn chosen uniformly and independently at random. At 
time each agent has a distinct rumor. When two agents meet, each receives 
a copy of all rumors carried by the other agent. How long does it take until 
all of the m agents have received all of the m rumors (gossiping time)? 

We prove that both the broadcasting and the gossiping times are 
Q [nj \fm) with high probability, thus achieving a tight characterization (up 
to polylogarithmic factors in n) of the complexity of rumor spreading in 
both scenarios. In the context of virus infection, a corollary of our results is 
that static agents placed at random locations and dynamic agents moving 
as independent random walks are infected at about the same speed. 

Although at a first glance the two models look similar, we do not have a 
simple reduction between them, hence we develop separate upper and lower 
bound proofs for each scenario. While the general approach employed in 
both cases is similar, there are some important differences in addressing the 
time dependencies, since in the first scenario random walks hit static agents, 
while in the second, random walks of informed agents need to collide with 
the random walks of the uninformed ones. 

It is not hard to verify a [r? jm^ tight bound for the broadcasting and 
gossiping times on an n-node line or ring. Combined with our results, this 
yields a G ((n^/m)^'^) bound for n node d-dimensional grids, with A = 1,2. 
It remains open whether this relation generalizes to finite grids of higher 
dimensions. 



1.1 Related work 

Information dissemination has been extensively studied in the hterature un- 
der a variety of scenarios and objectives. Due to space hmitations, we restrict 
our attention to the results more directly related to our work. 

A prolific line of research has addressed broadcasting and gossiping in 
static graphs, where the nodes of the graph represent active entities which 
exchange messages along incident edges according to specific protocols (e.g., 
push, pull, push-pull). The most recent results in this area relate the perfor- 
mance of the protocols to expansion properties of the underlying topology, 
with particular attention to the case of social networks, where broadcast- 
ing is often referred to as rumor spreading [6]. (For a relatively recent, 
comprehensive survey on this subject, see [12].) 

With the advent of mobile ad-hoc networks there has been growing in- 
terest in studying information dissemination in dynamic scenarios, where 
a number of agents move either in a continuous space or along the nodes 
of some underlying graph and exchange information when their positions 
satisfy a specified proximity constraint. In [7, 8] the authors study the 
time it takes to broadcast information from one of m mobile agents to all 
others. The agents move on a square grid of n nodes and in each time 
step, an agent can (a) exchange information with all agents at distance at 
most R from it, and (b) move to any random node at distance at most p 
from its current position. The results in these papers only apply to a very 
dense scenario where the number of agents is linear in the number of grid 
nodes (i.e., m = Q (n)). They show that the broadcasting time is {y/n/R) 
w.h.p., when p = O {R) and R = Q (ylogn) [7], and it is O {{\^/p) + logn) 
w.h.p., when p = Q (max{i?, -v/logn}) [8]. These results crucially rely on 
R + p = 0. (ylogn) , which implies that the range of agents' communications 
or movements at each step defines a connected graph. 

In more realistic scenarios, like the ones adopted in this paper, the num- 
ber of agents is decoupled from the number of locations (i.e., the graph 
nodes) and a smoother dynamics is enforced by limiting agents to move 
only between neighboring nodes. A reasonable setting consists of a set of 
multiple, simple random walks on a graph, one for each agent, with commu- 
nication between two agents occurring when they meet at the same node. 
One variant of this setting is the so-called Frog Model (corresponding to 
our Broadcasting scenario), where initially one of m agents is active (i.e., 
is performing a random walk), while the remaining agents do not move. 
Whenever an active agent hits an inactive one, the latter is activated and 
starts its own random walk. This model was mostly studied in the infinite 



grid focusing on the asymptotic (in time) shape of the set of vertices con- 
taining all active agents [3, 13]. A model similar to our Gossiping scenario 
is often employed to model the spreading of computer viruses in networks 
and the spreading time is also referred to as infection time. In [9], the au- 
thors provide a general bound on the average infection time when m agents 
(one of them initially affected by the virus) move in an n-node graph. For 
general graphs, this bound is 0(i*logm), where t* denotes the maximum 
average meeting time of two random walks on the graph, and the maximum 
is taken over all pairs of starting locations of the random walks. Also, in 
the paper tighter bounds are provided for the complete graph and for ex- 
panders. Observe that the O {t* log m) bound specializes to O (n log n log m) 
for the n-node grid by applying the known bound on t* of [1]. A tight bound 
of (n log n log m/m) on the infection time on the grid is claimed in [17], 
based on a rather informal argument where some unwarranted independence 
assumptions are made. Our results show that this latter bound is incorrect. 
Finally, a related line of research deals with the cover time of a random 
walk on a graph, that is, the expected time when all of the graph nodes are 
touched by the random walk. (See [1] for a comprehensive account of the 
relevant literature.) The cover time is strictly related to the hitting time 
[4] , namely the average time required of a random walk to reach a specified 
node. For n-node meshes, it is known that the hitting time is O(nlogn), 
while the cover time is O (n log n) [18, 5]. Bounds on the speed-up achieved 
on the cover time by multiple random walks as opposed to a single one are 
proved in [2, 10]. 

1.2 Organization of the paper 

The rest of the paper is organized as follows. In Section 2, we define the 
problem of interest in the two scenarios and establish some technical facts 
which are used in the analysis. Section 3 and Section 4 contain our results 
for the Broadcasting and Gossiping scenario, respectively. 

2 Preliminaries 

We study the dynamics of multiple independent random walks moving on an 
n-node 2-dimensional square grid G„ = {Vn,En), where Vn = {ii,j) \ 1 < 
i,j < \/n}, and En = {{{ij),{i,j + 1)) | 1 < i < ^/n,l < j < ^/n} U 
{{{i,j), {i + 1, j)) I 1 < i < ^/n, 1 < J < V^}- We also add self loops to the 
boundary nodes {{l,j),{^/n,j),{i,l),{i,^/n) \ 1 < i,j < ^/n} so to avoid 
bipartiteness and equalize the steady state distribution. We remark that all 



results in the paper can be immediately ported to the torus graph, where 
wrap-around edges substitute the self-loops. 

Although the two scenarios are defined with respect to a fixed number 
m of agents distributed uniformly and independently at random among the 
nodes of G„ (the exact model), technically, when deriving the upper bounds 
to broadcasting and gossiping times, it is easier to work with a slightly 
modified model in which each node v originally holds m„ agents, where rriv 
is distributed as a binomial variable B{m, 1/n) independently of the other 
nodes (the binomial m,odel). We denote by p{n) = m/n the density of the 
binomial model, and by m the random variable denoting the number of 
agents in a given instance of the model. For a sufficiently large p{n), we 
have m = Q (m) with high probability: 

Lemma 1. Let p{n) = m/n > (171ogn)/n, then, with probability 1 — l/n^ , 
the number of agents in the system, fh, satisfies 2np{n) < rh < |nj)(n). 

Proof. The number of agents is a binomial random variable with expecta- 
tion m = 17 log n. Applying a Chernoff-Hoeffding bound to their number, 
yielding 

1 \ 1^ 1 



Pr ( Im — ml > -m 1 < 2e s™ < 
V ' - 2 y - - n2' 

for a sufficiently large n. D 

Observe that a high-probability result for the binomial model with 
p(n) = m/n implies a similar high-probability result in the exact model 
with m agents (see [15, Corollary 5.9]). 

Moving agents follow independent, simple, symmetric random walks, 
that is, at each step an agent moves to a neighbor of its current location, 
chosen uniformly at random. Moreover, time is discrete, and moves are 
synchronized. For a given rumor r, we say that an agent is informed (of 
r) if it has a copy of r, otherwise it is uninformed (of r). In both the 
broadcasting and gossiping scenarios defined in the introduction, whenever 
an agent informed of r meets another agent uninformed of r, the latter 
becomes informed of r. 

Definition 1 (broadcasting time, gossiping time). In the Broadcasting sce- 
nario, the broadcasting time T-q is the first time at which all agents are 
informed of the single rumor. In the Gossiping scenario, the gossiping time 
Tq is the first time at which all agents are informed of all rumors. 

We denote by ||x — y|| the Manhattan (Li) distance between nodes x 
and y of Gn- Our analysis uses the following two technical lemmas which 



characterize the set of nodes visited by a random walk within a given time 
interval. 

Lemma 2. Consider a random walk on Gn, starting at time t = at node 
vq. There exists a positive constant ci such that for any v ^ vq, 

Pr (v is visited within (\\v — vo\\) steps) > p — ^ — -r. tt-t-. 

^ ^ max{l,log(||t>-wo||)} 

Proof. The Lemma is proven in [3, Theorem 2.2] for the infinite grid Z^. By 
the "Reflection Principle" [11, Page 72], for each walk in Z^ that started in 
Gn, crossed a boundary and then crossed the boundary back to G„, there is 
a walk with the same probability that does not cross the boundary and visits 
all the nodes in Gn that were visited by the first walk. Thus, restricting the 
walks to Gn can only change the bound by a constant factor. D 

Lemma 3. Consider the first i steps of a random walk in Gn which was at 
node Vq at time 0. 

1. The probability that at any given step 1 < i < i the random walk is at 
distance at least > X\/£ from vq is at most 2e~^ '^. 

2. There is a constant C2 such that, with probability greater than 1/2, by 
time i the walk has visited at least C2^/log£ distinct nodes in Gn- 

Proof. We observe that the distance from vq in each coordinate defines a 
martingale with bounded difference 1. Then, the first property follows from 
the Azuma-Hoeffding Inequality [15, Theorem 2.6]. 

As for the second property, let Ri be the set of nodes reached by the 
walk in i steps. By Lemma 2, E (Ri) = ^ (i/logi) (even when vq is near a 
boundary), while Var (i?^) = Q (^^/log £) (see [16]). The result follows by 
applying Chebyshev's inequality. D 



3 The Broadcasting scenario 

In the following two subsections we derive upper and lower bounds on Tb, 
the first time at which all agents are informed of the single rumor. 

3.1 An upper bound on the broadcasting time 

Since the cover time of G„ is O (nlog^ n) (see [18, 5]), we can easily achieve 
an O (n/y/m) upper bound on Tb, with high probability, when m is poly loga- 
rithmic in n. Therefore, in what follows we focus on the case m = Q (log' n) . 



To ease the analysis, it is convenient to envision the spreading process 
as partitioned into three successive phases (see Figure 1; the value ii and 
the constants q, q' will be set by the analysis): 

Phase I (Initial diffusion) The source agent informs a set Ai of at least 
q log n agents, all with origins in a square Bi of side length 8^1 log ' n. 

Phase II (Covering Bi) All agents originally placed in Bi are informed. 

Phase III (Covering Gn) All remaining uninformed agents are informed. 
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Figure 1: Broadcasting scenario. In the figure, at the beginning the source walks 
for 5 intervals of length l^ before reaching point vq at time t = 5£^. We imagine 
that in the interval starting at f the source visits fl (^^/logi?i) distinct nodes, each 
inside the square Bi of side £2 ■ The figure also shows the tessellation with side £3 
adopted when analyzing the covering of G„ . 

The following lemma bounds from above the completion time of Phase I. 

Lemma 4. Let £1 = \/(4qlogn)/(c2p(n)), for constants C2 > (defined in 
Lemma 3) and q > 0. Let Ti = Slflogn. With high probability, at time Ti 
there is a set Ai of informed agents such that: 

1- \Ai\ > glog^n; 

2. for each pair of agents ai , 02 £ Ai, their initial positions at t = are 
within distance 8ii log ' n. 



Proof. Partition the Ti time steps into 31ogn disjoint intervals of length £f, 
and consider the path of the source agent during one such interval. Let vq 
be the location of the agent at the beginning of the interval. 

By Lemma 3, with probability 2e~^°s " < 1/n^ all the nodes visited 
by the walk are within distance Mi log ' n from vq, and thus within dis- 
tance 8^1 log ' n of one other. Moreover, the same lemma implies that with 
probability > 1/2 at least C2^f/log£i distinct nodes are visited. 

Conditioning on visiting these many nodes in the interval, the expected 
number of agents informed by the walk is C2(^f/log£i)p(n) > 2g'log n. 
Applying the Chernoff bound, with probability at least 1 — 1/n, the number 
of agents informed by the path is at least ^log n. 

Thus, with probability at least 1/2 — 2/n one segment of the path satisfies 
the claim, hence the probability that the claim holds for one of the 3 log n 
segments is at least 1 — 1/n for n large enough. D 

Consider now an arbitrary square area Bi with side £2 = 8ii log ' n, 
containing all initial locations of the agents of the set Ai of Lemma 4. Next 
lemma bounds the time taken to complete Phase II. 

Lemma 5. Let T2 = Ad^. With high probability, every agent initially located 
in Bi at t = is informed by time T2 = Ti + T2 . 

Proof. We show that every node in Bi is visited by at least one agent in Ai 
before time T2. 

There are £2 nodes in Bi, and the diameter of Bi is at most 2^2- Applying 
Lemma 2 and the fact that the |^i| walks are independent, the probability 
that any node in Bi is not visited by some agent in Ai until time T2 is at 
most 

.9/ ci \l^il / cifllog^n \ 1 

^2 l-r^^ <exp - '^ ^ +2 log £2 <-, 

V log(2£2)/ V log(2^2) J n 

by selecting a suitably large value for the positive constant q. D 

To analyze the completion time of Phase III, we consider a tessellation of 

Gn into cells of side £3 = y q' log n/p{n), for some suitable constant q' > 0, 
such that one of the cells is entirely contained in Bi . We show that once all 
agents placed in one cell are informed, then with high probability all agents 
placed in the adjacent cells are informed within 20£| steps. 

Lemma 6. Consider a cell C , and let r be the first time when all the agents 
originating in C are informed. With probability greater than 1 — l/n^ , all 



agents originating in the (at most) 4 cells adjacent to C are informed at 
time r + 20£|. 

Proof. By the Chernoff bound, at least i'^p{n)/2 = q' \og njl agents orig- 
inate from a cell C, with probability at least 1 — l/2n^. The maximum 
distance between a node in C and a node in an adjacent cell is bounded by 
3^3. Thus, the probability that any node in the adjacent cells is not visited 
by one of the agents placed in C within 20£| steps after time r is bounded 
by 



log(3£3)y -''''^V 21og(3^3) 



4^1 ( 1 - TTZT^ ) < exp ( - ;?„:;:/: + 2iog2^3 1 < 



by selecting a suitably large value for the constant (( . D 

We are now ready to prove the main result of the subsection: 
Theorem 1. With high probability, 



Proof. The case m = O (log n) immediately follows from the observa- 
tion made at the beginning of the subsection. Consider now the case 
m = Q (log n) . 

With probability 1 — 1/n, Phase I terminates in (f logn steps (Lemma 4) 
and with probability 1 — 1/n, Phase II terminates in 4^2 steps (Lemma 5). 

In Phase III, with probability 1 — 1/n^, all agents in a given cell are 
informed no later than 20^1 steps after all agents in one of its adjacent cells 
are informed. Since the "cell distance" (number of cells) between any two 
cells is at most 2-^/n/^3, Phase III terminates in (2y^/-f3)20^3 = 40^3-y/n 
steps with probability at least 1 — 1/n. 

Putting it all together, we obtain that, with probability at least 1 — 
0(l/n), 

Tb < 3il logn + All + AOlaVn = O I ^^°^ ^ ] . q 



3.2 A lower bound on the broadcasting time 

We develop a lower bound of fi {nj ^/m) to the broadcasting time. The 
argument relies on the intuition that many agents are initially located at 
a large distance from one another: this limits the rate at which the rumor 
may spread. We need the following definition (see also Figure 2): 



Definition 2 (Island). Let A he the set of agents. For any parameter 7 > 0, 
let Gti^) be the graph with vertex set A and such that there is an edge between 
two vertices iff the corresponding agents are within distance 7 at time t. The 
island of parameter 7 of an agent a at time t, denoted by It{a;j), is the 
connected component of Gt{'j) containing a. 
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Figure 2: A partition of agents into islands with 7 = 3. 



In this section, we only consider the islands at time t = 0. Islands at 
t > are used for the lower bound on the gossiping time in the next section. 
We first prove that with high probability there are no big islands. 

Lemma 7. Let 7 = ^Jn/{Ae^m) and write A = VJ'^- -^Ij as the disjoint union 
of s islands of parameter "f . Then, with high probability, \Ij\ < logn, for all 
1 < J < s. 

Proof. Let Bk denote the event that there exists an island with at least k 
agents. It is easy to see that Pr(;Bfc) is upper bounded by the probability 
that Gti'y) contains a tree of k vertices of A as a subgraph. Since k'^~'^ is the 
number of unrooted labeled trees on k nodes, and 47^ /n is (an upper bound 
to) the probability that a given agent lies within distance 7 from another 
given agent, we have that 



Pr (Bk) < 



,k-2 



AY 
n 



fc-i 



< 



e\A\ 



k 



k-2 



AY 
n 



fc-i 



10 



Since 1^41 = m, setting k = 1 + logn and substituting the definition of 7, we 

obtain 

„ ,^ , e\A\ ofh A\ em 11 „ 

Pr (Bk) < 4;^e^2{fe-i) < < _. n 

/c^ k^ n^ n 

Since the agents are distributed uniformly and independently at random, 
with high probability there exists an agent placed at distance Vt (y/n) from 
the source of the rumor: 

Lemma 8. With probability 1 — 1/n^, at time at least one agent is placed 
at Li distance > y/n/2 from the source agent. 

Proof. Given node u of a grid, there are up to 4i nodes at distance i from 
V, so there are up to n/2 nodes at distance < \/n/2 from the source agent. 
The expected number of agents outside this area is np[n)/2 = Q (log n), 
thus the probability that no agent is placed at distance at least v^/2 from 
the source agent is bounded by 1/n^. D 

We are now ready to prove the lower bound on the broadcasting time: 
Theorem 2. With high probability, 



mlog n 



Proof. Let vq denote the agent placed at distance at least v^/2, whose 
existence is guaranteed with probability 1 — 1/n^ by Lemma 8. 

By setting the parameter 7 as in Lemma 7, with high probability no 
island has more than log n agents, hence the maximum distance between two 
agents in the same island is at most 7(log n— 1). Thus, with high probability, 
the rumor must traverse at least H = ^/n/{2J{logn — 1)) islands to reach 
node vq. 

Hence, for H — 1 non-overlapping time periods, the rumor must cover 
the inter-island distance which is at least 7. By applying Lemma 3, 
the probability that one inter-island distance be covered in at most r = 
(l/(24e^))n/(mlogn) steps is at most 1/n^, therefore, with probability 
at least 1 — 1/n the time to spread the rumor is at least {H — l)r = 
il (n/(-^/mlog^ n)). D 

4 The Gossiping scenario 

The Gossiping scenario differs from the Broadcasting scenario in that each 
agent has initially a distinct rumor to spread, and all agents perform inde- 
pendent random walks starting at time 0. 

11 



4.1 The meeting probability of two random walks 

The main new ingredient in analyzing the Gossiping scenario, which is also 
a result of independent interest, is a lower bound on the probability that two 
random walks on the grid meet within a given time interval. The following 
lemma is the analogous of Lemma 2 for the case of two walks. The proof, 
however, requires a different argument. 

Lemma 9. Consider two independent, simple random walks a and b, start- 
ing at time at node oq and bo, respectively, with oq ^ b^. Let at and bt 
be the locations of the walks at time t and /et T > ||ao — 6o|p. Then, there 
exists a constant C3 > such that 

p. ^(T) = Pr (3t < T such that at = bt) > ^^ 



max{l,log(||ao -6o||)}' 

Proof. The case ||ao — 6o|| = 1 is immediate. Consider now the case ||ao — 
60 1 1 > 1- Let Pt{w,x) denote he probability that a walk that started at 
node w at time is at node x at time t, and let R{w, u, s) be the expected 
number of times that two walks which started at nodes w and u at time 
meet during the time interval [0, s], then 



R{w,u,s) = y^ y^ Pt{w, x)Pt{u, x). 



t=o X 
Let T{a, b) be the first meeting time of the walks a and b. Then 

T 

R{ao,bo,T) = y^Pr (r(a, 6) = t)R{xt,xt,T - t) < P^i(T) max R{x,x,T). 

t=0 

Thus, setting Tq = ||ao — 6o|p, we have 

Let D{aQ, bo) be the set of nodes with distance up to ||ao~&o|| from both 
oq and bo, i.e., 

D{ao, bo) = {x I ||x — ooll < ||ao — 6o|| and ||a: — 6o|| ^ ||«o — &o||}- 



12 



It is easy to verify that \D{aQ,bo)\ > \\\ao — 6o|P- Applying Theorem 1.2.1 
in [14] we have: 

To 

R{ao,bo,To) > X] X] Pt{ao,x)Pt{bo,x) 

t=0 x£D{ao,bo) 

^0 / -, \ 2 



Z-^ Z-^ \ TT-I 



i=S)+ia:eD(ao,6o) 

By bounding ||x — ao|p and ||a; — 6o|P from above with Tq in the formula, 
easy calculations show that i?(ao,6o)^o) = ^(1)- Similarly, using the fact 
that there are no more than 4i nodes at distance exactly i from x, we have: 



maxR{x,x,T) < l + ^^4i4 



2 T 




,jr*H2 (e-l)2 

We conclude that there is a constant C3 > such that 

^ fi(ao,6o,ro) ^ C3 

"^•^^ ^ - max^R{x,x,To) " log(||ao - 6o||) ' 

4.2 An upper bound on the gossiping time 

We observe that since the Li diameter of G„ is 2^Jn — 2, we can use Lemma 9 
to show that with probability 1 — l/n^, at time 8nlog n an agent has met 
all other agents walking in G„. Thus, the theorem trivially holds for m 
polylogarithmic in n. 

Consider now the case m = ^ (log n) . As in the derivation of the upper 
bound to the Broadcasting time, we resort to the binomial model introduced 
in Section 2. Let fh be the number of agents in a given instance of the process 
and recall that m = (m) w.h.p. 

13 



We prove the theorem by bounding from above the spreading time of 
any fixed rumor. Specifically, let Tg be the time for spreading of the i-th 
rumor. We show that there is a constant C4 > such that, for 1 < i < m, 

\ \nn ] n^ 



The argument proceeds as follows. We tessellate G„ into cells of suitable 
side l\ (defined in Lemma 10) and say that a cell Q is reached at time tg if 
Iq is the first time when a node of the cell hosts an agent informed of the 
rumor. We call this first visitor the explorer of Q, and let Aq denote the 
set of agents inside Q at time tq . 

We first show that, after a suitably chosen number of steps after tq, cell 
Q is conquered, in the sense that a large number of agents of Aq have been 
informed. Also, we show that by the time cell Q is conquered, each of its 
neighboring cells has been reached, and that a large number of informed 
agents are within a short distance from Q. These facts imply that the 
conquering process proceeds smoothly. Finally, we prove that all agents are 
informed of the rumor shortly after all cells are conquered. This argument 
is made rigorous in the following sequence of lemmas. 

Lemma 10. Let li = ^Mg^nlog n/(c3m), where q > and C3 is defined in 

Lemma 9. Let also Ti = Ai\. Then, for any cell Q of the tessellation, by 
time Ti = tq+Ti, at least qlog^ n agents of Aq are informed with probability 
1 — 1/n^ , for sufficiently large n. 

Proof. Since at any given time the agents are at random and independent 
locations, the expected number of agents in Q at time tq is 4glog n/c^- 
Thus, by the Chernoff bound at least 4glog n/(2c3) agents are inside Q at 
time tQ with probability at least 1 — 1/n^. 

For each agent a G Aq , define an indicator variable Xa such that Xa = 1 
iff agent a has met the explorer of Q by time ri. The XaS are independent 
and X = ^„g^ Xa is a lower bound to the number of informed agents 
in Aq by time ri. Applying Lemma 9, we have Pr (AT^ = 1) > io%(_ > 
hence E (A) > 2q log n. Using the Chernoff bound, we finally obtain 
Pr (a < qlog n) < exp (— glog n/4) < 1/n^, for a sufficiently large n. D 

Lemma 11. Consider a cell Q, and define R to be the square of side £2 = 
(1 + 2\/2)ii centered at Q (see Figure 3). Let ri and Ti be defined as 
in Lemma 10, and let T2 = M'^. Let Ar C Aq denote the set of agents 
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residing inside Q at time tq and contained in R at time ti. Finally, let 
T2 = Ti + T2 = tq + Ti + T2. Then, with probability at least 1 — 1/n^, 

1. at least ^(1 — fj^log n informed agents are within distance £2 from 
Q at time Ti; 

2. 1(1 - l)'-^ < \An\ < 1(1 - f)^^; 

3. every agent a G Aji is informed by time T2; and 

4. each of the neighboring cells of Q has been reached by time T2- 

Proof. Observe that by Lemma 3, the probabihty that an agent a contained 
in Q at time tq is outside R at time ti = tq + Ti is at most 2/e, whence 
Pr (a G Aji\a G Aq) > (l — |) = r. Since (by Lemma 10) at least (7 log n 
agents of Aq are informed at time ri, and each of them belongs to Aji in- 
dependently with probability r, we can apply the Chernoff-Hoeffding bound 
to show that with probability 1 — 1/n^ the number of informed agents in R 
at time n is at least Ir{ti) = §(7 log n. This proves Point (1), and Point 
(2) follows by a similar argument. 

For what concerns Point (3), consider an agent a G A/j, not informed at 
time Ti. Since a is within distance 2^2 from the informed nodes belonging 
to A/j, by Lemma 9, the probability that a is not informed at time T2 is at 
most 

C3 V"^"^^ . (_cMti)\ ^ l^ 

\0g2i2) -''''''V log2^2 y n6 

by selecting a suitably large constant q. The point is proven by applying 
the union bound on \Ar\ = O (log n) agents. 

Finally, for Point (4), consider one of the neighboring cells of Q, say 
Q' . Since Q' C i?, each point of Q is at distance at most 2^2 from the 
actual location of the Ir{ti) informed agents which are inside R at time ti. 
Therefore, applying Lemma 2, we can conclude that the probability that Q' 
is not reached by time T2 is at most 

ci \ l'3'l'^«(^i) / d rq\og^ n Aqlog^ n\ 1 



^ log 2^2/ <exp|^ ^^^2^^ 2 C3m/nj^n6' 

for a sufficiently large n. We conclude the proof by applying the union 
bound over the neighboring cells. D 

A cell Q is conquered when all agents in the set Ar, defined in the 
previous lemma, are informed. 

15 















— 




— 




v/24 
h H 


R 




_ 


— \ 




1 Q 












i 


h: 








j 




























4 




















t 


^2 


' 















/n/£i cells ■ 



Figure 3: Gossiping scenario. The tessellation adopted in the analysis of the 
spreading of a rumor. A cell Q and the corresponding area R around it are shown. 

Lemma 12. With the same notation of Lemma 11, with probability 1 — 1/rfi, 
for any time instant T2 < t < 8n log n, there are at least Q. (log n) informed 
agents at distance at most £2 from any node of Q. 

Proof. Consider an agent a G Ar and let Iqi^t) denote the set of informed 
agents inside Q at time t> T2- 

By Lemma 11, the number Nji{ti) of agents inside R at time ri is at 
least \Aii\ = Q (log^ n). Since each of the agents of Ar was inside cell Q at 
time tQ, the probability r' that a is still inside Q at time T2 is (^1/^2) = 
@ (1). Since the agents perform independent random walks, we can apply a 
concentration bound and get that 



|/q(t2)| > Ir'NRin) > ^ 



, Aq log^ n 

r 

C3 



with probability greater than 1 — 1/n^ for a sufficiently large n. Next, observe 
that the probability that one of the agents of Iq{t2) is still inside R (i.e., is 
within distance £2 from Q) at time T3 = T2 + Ti + T2 is at least r = (1 — |) 
by Lemma 3. Therefore, applying again the concentration bound, we have 
that, with probability greater than 1 — 1/n^ (for a sufficiently large n), there 
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are at least 



/ 2 



NR{t)> -—qlog-'n 

4C2 



informed nodes within distance £2 from Q in each of the time instants T2 < 

t<T3. 

Now consider the time interval [t2 , 8n log n] and partition it into con- 
secutive, non-overlapping subintervals of length Ti + T2. Let tk = T2 + {k — 
l)(Ti + T2) denote the beginning of the k-th subinterval. For any subinterval 
k, we show that (a) \lQ{tk)\ > 1 and (b) at least Q (log n) informed agents 
are inside R during the k-th subinterval. Note that, by (1) and (2), the 
two properties hold for the first subinterval [t2, T3) with probability at least 
1 - 1/n^ 

The key idea for extending the properties to A; > 1 is that we can pick an 
arbitrary informed agent a inside Q at time tk and make it play the role of 
the explorer of Q. More precisely, by mimicking the analyses of Lemmas 10 
and 11, provided that we replace tg with tk, we can show that the presence 
of the explorer ensures the existence of Q (log n) informed agents at the 
beginning of the next interval. This observation, combined with (1) and (2), 
proves that if properties (a)-(b) hold for the k-th subinterval, they will hold 
also for the (A; + l)-th subinterval. 

Applying the union bound over O (nlog rij subintervals concludes the 
proof. n 

We are now ready to prove the main theorem of this subsection: 

Theorem 3. With high probability, 



Proof. As observed at the beginning of the subsection, we can limit ourselves 
to the case m = Q (log rij . Consider first the spreading of the i-th rumor, 
and tessellate the domain with cells of side ii, specified in Lemma 10. 

Consider a cell Q reached for the first time at time tg. By Lemma 11, 
we know that each of the neighboring cells of Q is reached within time 
T2 = tg + Ti + T2 with probability 1 — 1/n^. Therefore, all cells are reached 
within time {2^/n/il){Tl + T2) with probability 1 — 1/n^. 

For any agent a, let ta be the first time when the agent is inside a 
cell that is either reached or conquered at time ta, and observe that ta < 
{2^/n/il+l){Tl-\-T2) with probability 1 — 1/n^. Using the result of Lemma 12 
about the number of informed agents near the cell where agent a resides at 
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time ta, an argument similar to the one used in the proof of Point (3) of 
Lemma 11 shows that a wih be informed by time ta + T2 with probabihty 
1 - l/n^. 

Putting it aU together, by setting C4 = 64(5 + 1\/2)^JqJoi^ we have 
proved that the broadcasting time Tg of the i-th rumor satisfies 

Pr T^ > ^-L < ^. 



The theorem foUows by observing that the broadcasting time of a rumor 
does not depend on the origins of the other rumors, so we can take the 
union bound over aU m = O (n) different rumors and conclude that Tq = 
O (nlog ' nj^frnA with probability 1 — 1/n. D 



4.3 A lower bound on the gossiping time 

In the Gossiping scenario uninformed agents move, hence we need to resort 
to a lower bound argument which takes such movements into account. We 
define the informed area X{t) at time t as the set of places visited by an 
informed agent up to time t. The frontier of X(t) is the border separating 
the informed area from the remaining places. To simplify the exposition, we 
imagine that the rumor travels from left to right and consider the grid node 
a;(t) of the rightmost informed agent at time t. By definition, the informed 
area lies to the left of x{t) and we need to show that there is a sufficiently 
large value T such that, at time T, there is at least one uniformed agent 
right of x{T). 

First, we can prove that, with probability 1 — 1/ra^, at each time instant 
< i < 8n log n, every island of parameter 7 = y^n/{Ae^m) has no more 
than logn agents. The proof is similar to the argument used in Lemma 7 
for the broadcasting scenario: 

Lemma 13. Let 7 = Y^n/(4e^m). Then, the probability that there exists an 
island of parameter 7 in any time instant < t < 8n log n with more than 
logn agents is at most 1/n^. 

Proof. Since at any given time the agents are uniformly distributed in Gn, 
the probability that a given agent is within distance 7 of another given agent 
at time to is bounded by 47^/n. Fix a time instant to and let Bk{to) denote 
the event that there exists an island with at least k > log n elements at time 
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to- Then, 



'"^ i*-^- (^jL\'"' < ('^•\'-i,-'^ /'fi!' '"' 



Using definition of 7 and the bound /c > 1 + log n and m < n, we have 



T-, rr, 1^ \\ ^ ^'^ ~^{k~\\ ^ en 1 1 



/c^ A;^ n^ n^ 

for a sufficiently large n. Applying the union bound on O (nlog n) time 
instants concludes the proof. D 

Next we show that, with high probability, the frontier of the informed 
area cannot advance too fast. 

Lemma 14. Let 7 = Y^n/(4e^m) and lei to o-nd ti = to + 7^/(36 log n) 6e 
two time steps. Then, with probability 1 — 1/n^, 

||x(ti)-x(to)|| < (7logn)/2. 

Proof. Consider the island / = /(y(ao;7) of the informed agent ao located 
at node a; (to) at time to. 

By Lemma 3, with probability 1 — 2/n^ an agent cannot cover a distance 
of more than 7/2 in 7^/(36 log n) time steps. Thus, with probability 1 — 1/n^, 
up to time ti the rumor cannot propagate (directly or through intermediate 
agents) outside the set of agents /, and the farthest it can get from x(to) by 
members of / is bounded by (7logn)/2 since the island has no more than 
logn agents by Lemma 13. 

Then, ||x(ti) — x(to)|| < (7logn)/2 with probability 1 — 1/n^. D 

Finally, we can prove the main theorem of the subsection: 
Theorem 4. With high probability, 

/mlog n 



Proof. By Lemma 8, with high probability there exists an agent a whose 
distance at time from the source was at least ^fnjl. 

Let T = n/(144e^-y/mlog n) and 7 = Y^n/(4e^m). By Lemma 14, with 
probability 1 — 1/n the frontier cannot move right in T steps more than 
(7logn/2)r/(7V(361ogn)) = Vn/4. 
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By Lemma 3, with probability 1 — 2/n^, agent a cannot move left more 
than 2^/Tlogn < ^JuJAl and thus that agent is not informed at time T . 
Hence, the gossiping time is at least Tq > T = Q, (^n/{^/mlog n)) with 
probability 1 — 1/n. D 

Acknowledgment: Many thanks to Jeff Steif for referring us to some 
crucial references. 
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